Similarity scoring for recognizing repeated out-of-vocabulary words

نویسندگان

  • Mirko Hannemann
  • Stefan Kombrink
  • Martin Karafiát
  • Lukás Burget
چکیده

We develop a similarity measure to detect repeatedly occurring Out-of-Vocabulary words (OOV), since these carry important information. Sub-word sequences in the recognition output from a hybrid word/sub-word recognizer are taken as detected OOVs and are aligned to each other with the help of an alignment error model. This model is able to deal with partial OOV detections and tries to reveal more complex word relations such as compound words. We apply the model to a selection of conversational phone calls to retrieve other examples of the same OOV, and to obtain a higher-level description of it such as being a derivation of a known word.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Orthographic Knowledge and Lexical Form Influence Vocabulary Learning.

Many adults struggle with second language acquisition, but learn new native-language words relatively easily. We investigated the role of sublexical native-language patterns on novel word acquisition. Twenty English monolinguals learned 48 novel written words in five repeated testing blocks. Half were orthographically wordlike (e.g., nish, high neighborhood density and high segment/bigram frequ...

متن کامل

Recognition of out-of-vocabulary words with sub-lexical language models

A major source of recognition errors, out-of-vocabulary (OOV) words are also semantically important; recognizing them is, therefore, crucial for understanding. Success, so far, has been modest, even on very constrained tasks. In this paper we present a new approach to unlimited vocabulary speech recognition based on using graphemeto-phoneme correspondences for sub-lexical modeling of OOV words,...

متن کامل

Psycholinguistic Ambiance of Short Stories in Enhancing Students’ Reading Comprehension and Vocabulary Power

Abstract The present study was carried out to investigate the effect of short stories on students’ reading comprehension, vocabulary power and attitude towards the skill and the new instructional materials. The participants of the study were 120 grade 9 students of Dilla Secondary and preparatory school. In order to gather data for the study, pre- and posttest of reading comprehension, pre and ...

متن کامل

Wordlikeness and Novel Word Learning

Many adults struggle with second language acquisition, but learn new words in their native language relatively easily. Most second language words do not follow native language patterns, but those that do may be easier to learn because they make use of existing language knowledge. Twenty English monolinguals learned to recognize and produce 48 novel written words in five repeated testing blocks....

متن کامل

Psycholinguistic Ambiance of Short Stories in Enhancing Students’ Reading Comprehension and Vocabulary Power

Abstract The present study was carried out to investigate the effect of short stories on students’ reading comprehension, vocabulary power and attitude towards the skill and the new instructional materials. The participants of the study were 120 grade 9 students of Dilla Secondary and preparatory school. In order to gather data for the study, pre- and posttest of reading comprehension, pre and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010